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Abstract 

Molecular mimicry defined as similar structures 
shared hy molecules from dissimilar genes or by their 
protein products, is a general strategy for pathogens 
to infect host cells. Therefore, identification of the 
molecular mimic regions of a pathogen may be helpful 
to understand the disease. Severe acute respiratory 
syndrome (SARS) is a new human respiratory 
infectious disease caused by SARS coronavirus 
(SARS-CoV). The virus uses the spike (S) protein to 
interact with the angiotensin converting enzyme 2, the 
host cell receptor. Our approach is to design a 
workflow with multiple bioinformatics tools in 
analyzing the sequence of spike protein of the SARS- 
CoV in searching its similarity to human proteins. 
Furthermore, eleven peptides have been synthesized to 
validate the in silico results. 


1. Introduction. 

Severe Acute Respiratory Syndrome” (SARS) is a 
new emerging infectious disease, which was first 
reported in China in 2002 [1, 2]. The disease is 
caused by a novel coronavirus (SARS-CoV) [3, 4, 5]. 
SARS-CoV is a positive-stranded RNA virus. The 
genome of SARS-CoV is around 29,727 nucleotides in 
length. The sequence was annotated in silico [6]. 
According to the genomic sequence of SARS-CoV, it 
is predicted that SARS-CoV produces several 
structural proteins including spike (S), envelop (E), 
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membrane (M), and nucleocapsid proteins. S protein 
is a key molecule for the initial viral entry into human 
cells. The majority of S protein (residues 12-1195) is 
outside the virus particle, which can be divided into 
amino-terminal SI and carboxyl-terminal S2 domain. 
The SI domain (residues 12-672) binds to the host cell 
receptor, angiotensin-converting enzyme 2 (ACE2) 
while the S2 domain is responsible for membrane 
fusion [7, 8, 9]. 

Molecular mimicry, which is defined as similar 
structures shared by molecules from dissimilar genes 
or by their protein products, is a general strategy for 
pathogens to infect host cells and has been proposed as 
a pathogenic mechanism for autoimmune disease [10]. 
Therefore, identification of the molecular mimic 
regions of pathogen may be helpful to understand the 
disease induced by that pathogen. At present, it is 
unclear whether molecular mimicry occurs between 
SARS-CoV S proteins and human peptides. We have 
approached this question by in silico analyzing the 
sequence of spike protein of SARS-CoV and select 
regions that share the sequence homology with human 
proteins. Synthetic peptides were synthesized to 
validate the prediction. 

2. Materials and Methods. 

2.1. Peptide prediction and synthesis 

Publically available human and coronavirus 
genome sequences at the National Center for 
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Biotechnology Information USA were used for in 
silico prediction. Immunogenic viral peptides were 
calculated based on the algorithm developed by 
Kolaskar and Tongaonkar [11]. In silico secondary 
structural analyses of spike protein were performed 
based on PHD [12] and PREDATOR [13] algorithms. 
Protein topology prediction was based on the 
algorithm developed by TMHMM [14]. 
Hydrophobicity of the peptides was calculated based 
on the algorithm HMOMENT [15]. Similarity 
searches between S protein and human genome 
database were performed by using BLASTP [16]. 
Extra amino acid residues were added at either N- or 
C- terminus to decrease the hydrophobicity. Multiple 
antigen peptides were synthesized by CytoMol Corp 
(Mountain View, CA, USA). 

2.2 Mice immunization 


golgi autoantigen, angrgm-52, and pallidin, 
respectively. Among these regions, region 3 has the 
highest score. 
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Six to eight-week-old female BALB/c mice were 
originally purchased from Jackson Laboratory (Bar 
Harbor, ME, USA) and bred in the Laboratory Animal 
Center, National Cheng Kung University. Synthetic 
peptides were emulsified with complete Freund’s 
adjuvant and injected intraperitoneally. Mice were 
boosted intraperitoneally two weeks after priming. 
Sera were collected from the axially plexus of the mice 
at different time intervals. 

2.3 SDS-PAGE and Western blot analysis 

Proteins in the cell lysate of A549 were separated by 
12% SDS-PAGE and transferred to nitrocellulose 
sheets. Proteins recognized by normal or peptide DO8 
hyperimmune mice sera were detected by using HRP- 
conjugated anti-mouse immuno- globulin antibodies 
and substrates. 

3. Results. 

3.1 Search for molecular mimic regions in S 
protein. 

The whole amino acid sequence of spike protein 
was analyzed to find out the potential immunogenic 
regions and the regions shared sequence homology 
with human proteins, which is defined as the 
pathogenic regions. As shown in Figure 1, there are 4 
pathogenic regions. Region 1 (residues 199-254), 
region 2 (residues 658-715), region 3 (residues 893- 
941), and region 4 (residues 1127-1184) have shared 
sequence homology with hydroxyacid oxidase, human 


Figure 1 In silico analysis of S protein. 

S protein amino acid sequence was analyzed to find 
immunogenic regions (yellow regions) and pathogenic 
regions (regions with shared sequence homology with 
human proteins, blue regions). Purple regions are both 
immunogenic and pathogenic regions. Grey region is the 
leader sequence and brown region is the transmembrane 
region. 

In addition, because des-Arg bradykinin and Ang I 
are the substrates for ACE2 [17], we also compared 
the sequence of S protein against bradykinin and Ang I 
and found that residues 490-502 
(GYQPYRVVVLSFEE) of S protein showed 
sequence homology with bradykinin (Figure 2). 

Spike protein GYQPYRVWLS 

| * * | \ | w w a * 

Bradykinin £ FSF FRSSRIG 

Figure 2 Sequence homology between S protein 

and bradykinin. 

Pair wise alignment was calculated based on Smith- 
Water local alignment with matrix set at EBLOSUM 45. 
“ | ”is annotated for identical residues; and are for 
similar residues. 

3.2 In silico predicted pathogenic peptides. 

Eleven antigenic peptides (Table 1), which 
represent those pathogenic regions were predicted and 
synthesized. 

Table 1. Sequence of the eleven synthetic peptides. 
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Extra amino acid residues which were indicated by italic 
letters were added at either N- or C-terminus to decrease the 
hydrophobicity. 


Peptide 

Amino acid positions 

Amino acid sequence 

No. of amino acids 

D01 

199-210 

GYQPIDVVRDLG 

12 

D02 

658-669 

ASYHTVSLLRST SQK 

15 

D03 

733-744 

EEGNLLLQYGSFCTQ 

15 

D04 

745-753 

EELNRALSGIAGQ 

13 

D05 

763-770 

VFAQVKQM 

8 

D06 

911-919 

KAISQIQES LTTE 

13 

D07 

927-937 

GLGKLQDVVNQNGE 

14 

D08 

942-951 

ALNTLVKQLSSN 

12 

D09 

1154-1162 

INASWNIQK 

10 

DIO 

490-502 

GYQPYRWVLSFEE 

14 

Dll 

306-317 

GFRWPSGDWRE 

13 


3.3 Hyperimmune sera against D08 cross- 
reacted with A549 cells 


To test whether synthetic peptides D01, D07 D08 
can induce antibodies cross-reacted with human 
proteins, we immunized mice with these peptides to 
generate hyperimmune sear against these peptides. 
Using Western blot analysis, hyperimmune sera 
against D08 could recognize more bands in A549 cell 
lysate as compared to normal mice sera (Figure 3). In 
addition, hyperimmune sera against D07 but not D01 
showed similar cross-reactivity to A549 cells as 
hyperimmune sera against DO8 did (data not shown). 



Normal DO 8 

Figure 3 Anti-D08 peptide sera recognizes protein 
from human A549 cell line. 

3.4 Hyperimmune sera against DIO cross- 
reacted with bradykinin 

To test whether the synthetic peptides DIO indeed 
can induce antibodies cross-reactive with bradykinin 
and Ang I, as predicted, we immunized mice with DIO 
peptides to generate hyperimmune sera against this 
peptide. Significant increase of antibodies against DIO 
was found in DIO hyperimmune sera, which could 


cross-react with bradykinin- but not with Ang I-coated 
plates (Figure 4). 




dilution 

Figure 4 The cross-reactivity of DIO antibody with 

bradykinin and Ang I. 

Hyperimmune sera from DIO immunized mice (■) or 
normal mice sera (A) were diluted as indicated and reacted 
with D10-, bradykinin- or Ang I-coated ELISA plates as 
indicated. Bound antibodies were detected as described in 
Materials and Methods. Data represents the mean ± SD of 
triplicates. 


4. Discussion. 

In this study we have identified four pathogenic 
regions of SARS-CoV S protein which share sequence 
homology with different human proteins. Among them, 
pathogenic region 3 (residues 893-941), which shares 
sequence homology with angrgm-52 (GenBank 
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accession no. AAL62340), a novel gene up-regulated 
in human mesangial cells stimulated by angiotensin II, 
may deserve further investigation. Peptides D07 and 
DO8 of this region were recognized by the sera of 
SARS patient indicating that this region is 
immunogenic and can be recognized by the immune 
system during SARS-CoV infection. Murine 
hyperimmune sera against peptides D07 or DO8 were 
able to bind to recombinant S2 but not S1 domain of S 
protein (data not shown). In addition, hyperimmune 
sera against D07 or DO8 also bound to the cytoplasmic 
region of A549 cells and recognized several proteins 
in the A549 cell lysate. These results indicate that 
regions represented by D07 and DO8 are immunogenic 
and may induce autoantibodies. However, further 
study is required to understand the biological function 
of these regions and the role of their antibodies in the 
pathogenesis of SARS-CoV infection. 

In addition to D07 and DO8 peptides, we also 
noticed DIO peptide which represents residues 490- 
502 of SI domain contained some interesting activities. 
The DIO peptide, which shared sequence homology 
with bradykinin, was able to generate antibodies cross¬ 
reactive with bradykinin. In addition, DIO peptide 
could stimulate A549 to produce IL-8 and 
proliferation as Ang I did. These results suggest that 
the region of DIO in S protein may bind to Ang I 
receptor, ACE2, and may be involved in the binding of 
SARS-CoV to ACE2. This is consistent with the 
previous report, which indicates that residues 318-510 
of SI domain can bind to ACE2 [18] and is similar to 
the receptor binding domain of the HCoV-229E, 
which is within a fragment containing residues 407 to 
547 [19]. Therefore, region 490-502 of SI domain 
may be involved in the receptor binding domain of 
SARS-CoV. 

In summary, our results suggest that molecular 
mimicry occurs between SARS-CoV and host proteins. 
Motifs shared sequence homology with host proteins 
of SARS-COV may be involved in the binding and 
fusion of SARS-CoV to host cells. Antibody against 
these motifs may contain neutralization activity against 
SARS-CoV infection or participate in the 
immunopathogenesis induced by SARS-CoV. Small 
molecules derived from molecular structure of these 
motifs may provide alternative approaches to disrupt 
the infection of SARS-CoV [20]. 
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